Discriminative Slot Detection Using Kernel Methods

نویسندگان

  • Shubin Zhao
  • Adam Meyers
  • Ralph Grishman
چکیده

Most traditional information extraction approaches are generative models that assume events exist in text in certain patterns and these patterns can be regenerated in various ways. These assumptions limited the syntactic clues being considered for finding an event and confined these approaches to a particular syntactic level. This paper presents a discriminative framework based on kernel SVMs that takes into account different levels of syntactic information and automatically identifies the appropriate clues. Kernels are used to represent certain levels of syntactic structure and can be combined in principled ways as input for an SVM. We will show that by combining a low level sequence kernel with a high level kernel on a GLARF dependency graph, the new approach outperformed a good rule-based system on slot filler detection for MUC-6.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Remote homology detection based on oligomer distances

MOTIVATION Remote homology detection is among the most intensively researched problems in bioinformatics. Currently discriminative approaches, especially kernel-based methods, provide the most accurate results. However, kernel methods also show several drawbacks: in many cases prediction of new sequences is computationally expensive, often kernels lack an interpretable model for analysis of cha...

متن کامل

Features Extraction For Protein Homology Detection Using Hidden Markov Models Combining Scores

Few years back, Jaakkola and Haussler published a method of combining generative and discriminative approaches for detecting protein homologies. The method was a variant of support vector machines using a new kernel function called Fisher Kernel. They begin by training a generative hidden Markov model for a protein family. Then, using the model, they derive a vector of features called Fisher sc...

متن کامل

Evaluation of Cardiovascular Disease Risk in the China Kadoorie Biobank Using Novelty Detection

We evaluate the risks of cardiovascular disease to the Chinese population by i) detecting ”abnormality” using 3 one-class classification methods (a discriminative one-class support vector machine (SVM), a generative kernel density estimate (KDE), and a discriminative KDE), and ii) predicting probabilities of ”normality”, arrhythmia, and ischemia using 3class classification method (a discriminat...

متن کامل

Alignmentfreie Analyse von Proteinsequenzen mit Verfahren des maschinellen Lernens

Motivation: Remote homology detection is among the most intensively researched problems in bioinformatics. Currently discriminative approaches, especially kernel-basedmethods, provide themost accurate results. However, kernel methods also show several drawbacks: in many cases prediction of new sequences is computationally expensive, often kernels lack an interpretable model for analysis of char...

متن کامل

The Spectrum Kernel: A String Kernel for SVM Protein Classification

We introduce a new sequence-similarity kernel, the spectrum kernel, for use with support vector machines (SVMs) in a discriminative approach to the protein classification problem. Our kernel is conceptually simple and efficient to compute and, in experiments on the SCOP database, performs well in comparison with state-of-the-art methods for homology detection. Moreover, our method produces an S...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004